Exponentially weighted moving average charts for detecting concept drift
نویسندگان
چکیده
.Classifying streaming data requires the development of methods which are computationally efficient and able to cope with changes in the underlying distribution of the stream, a phenomenon known in the literature as concept drift. We propose a new method for detecting concept drift which uses an Exponentially Weighted Moving Average (EWMA) chart to monitor the misclassification rate of an streaming classifier. Our approach is modular and can hence be run in parallel with any underlying classifier to provide an additional layer of concept drift detection. Moreover our method is computationally efficient with overhead O(1) and works in a fully online manner with no need to store data points in memory. Unlike many existing approaches to concept drift detection, our method allows the rate of false positive detections to be controlled and kept constant over time.
منابع مشابه
Robust economic-statistical design of the EWMA-R control charts for phase II linear profile monitoring
Control charts are powerful tools to monitor quality characteristics of services or production processes. However, in some processes, the performance of process or product cannot be controlled by monitoring a characteristic; instead, they require to be controlled by a function that usually refers as a profile. This study suggests employing exponentially weighted moving average (EWMA) and range ...
متن کاملMixed Exponentially Weighted Moving Average-Cumulative Sum Charts for Process Monitoring
The control chart is a very popular tool of statistical process control. It is used to determine the existence of special cause variation to remove it so that the process may be brought in statistical control. Shewhart-type control charts are sensitive for large disturbances in the process, whereas cumulative sum (CUSUM)–type and exponentially weighted moving average (EWMA)–type control charts ...
متن کاملFuzzy exponentially weighted moving average control chart for univariate data with a real case application
Statistical process control (SPC) is an approach to evaluate processes whether they are in statistical control or not. For this aim, control charts are generally used. Since sample data may include uncertainties coming from measurement systems and environmental conditions, fuzzy numbers and/or linguistic variables can be used to capture these uncertainties. In this paper, one of the most popula...
متن کاملThe Detection of Shifts in Autocorrelated Processes with Moving Range and Exponentially-Weighted Moving Average Charts
The objective of this research is to select the appropriate control charts for detecting a shift in the autocorrelated observations. The autocorrelated processes were characterized using AR (1) and IMA (1, 1) for stationary and non-stationary processes respectively. A process model was simulated to achieve the response, the average run length (ARL). The empirical analysis was conducted to quant...
متن کاملAn EWMA p Chart Based On Improved Square Root Transformation
Generally, the traditional Shewhart p chart has been developed by for charting the binomial data. This chart has been developed using the normal approximation with condition as low defect level and the small to moderate sample size. In real applications, however, are away from these assumptions due to skewness in the exact distribution. In this paper, a modified Exponentially Weighted Moving Av...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Pattern Recognition Letters
دوره 33 شماره
صفحات -
تاریخ انتشار 2012